Search CORE

12 research outputs found

A Bloom filter based semi-index on $q$ -grams

Author: Grabowski Szymon
Raniszewski Marcin
Susik Robert
Publication venue
Publication date: 10/07/2015
Field of study

We present a simple

q

-gram based semi-index, which allows to look for a pattern typically only in a small fraction of text blocks. Several space-time tradeoffs are presented. Experiments on Pizza & Chili datasets show that our solution is up to three orders of magnitude faster than the Claude et al. \cite{CNPSTjda10} semi-index at a comparable space usage

arXiv.org e-Print Archive

Optimization of Magnetic Field-Assisted Synthesis of Carbon Nanotubes for Sensing Applications

Author: Kołaciński Zbigniew
Pyć Marcin
Raniszewski Grzegorz
Publication venue: 'MDPI AG'
Publication date: 01/01/2014
Field of study

One of the most effective ways of synthesizing carbon nanotubes is the arc discharge method. This paper describes a system supported by a magnetic field which can be generated by an external coil. An electric arc between two electrodes is stabilized by the magnetic field following mass flux stabilization from the anode to the cathode. In this work four constructions are compared. Different configurations of cathode and coils are calculated and presented. Exemplary results are discussed. The paper describes attempts of magnetic field optimization for different configurations of electrodes

Multidisciplinary Digital Publishing Institute

CiteSeerX

Directory of Open Access Journals

PubMed Central

Lodz University of Technology Repository

Data classification, data set reduction and editing algorithms using the representative measure

Author: Raniszewski Marcin
Publication venue: Lodz University of Technology. Press
Publication date: 01/01/2010
Field of study

Klasyfikacja danych to podejmowanie decyzji na podstawie informacji, które te dane przenoszą (tzw. cech danych). Prawidłowa i szybka klasyfikacja zależy od prawidłowego przygotowania zbioru danych, jak i doboru odpowiedniego algorytmu klasyfikacji. Jednym z takich algorytmów jest popularny algorytm najbliższego sąsiada (NN). Jego zaletami są prostota, intuicyjność i szerokie spektrum zastosowań. Jego wadą są duże wymagania pamięciowe i spadek szybkości działania dla ogromnych zbiorów danych. Algorytmy redukcji usuwają znaczną część elementów ze zbioru danych, co znacząco przyspiesza działanie algorytmu NN, jednocześnie pozostawiając te, na podstawie których nadal można z zadawalającą jakością klasyfikować dane. Algorytmy edycji oczyszczają zbiór danych z nadmiarowych i błędnych elementów. W artykule zaprezentowane zostaną algorytm redukcji i algorytm edycji zbiorów danych, obydwa wykorzystujące miarę reprezentatywności. Testy przeprowadzono na kilku dobrze znanych w literaturze zbiorach danych różnej wielkości. Otrzymane wyniki są obiecujące. Zestawiono je z wynikami innych popularnych algorytmów redukcji i edycji.In data classification we make decision based on data features. Proper and fast classification depends on a Preparation of a data set and a selection of a suitable classification algorithm. One of these algorithms is popular Nearest Neighbor Rule (NN). Its advantages are simplicity, intuitiveness and wide rangę of applications. Its disadvantages are large memory requirements and decrease in speed for large data sets. Reduction algorithms remove much of data, which significantly speeds up NN. Simultaneously, they leave that data on the basis of which we can still make decisions with an acceptable classification quality. Editing algorithms remove redundant and atypical data from a data set. In this paper new reduction and editing algorithms, both using the representative measure, are presented. Tests were performed on several well-known in the literature data sets of different sizes. The results are promising. They were compared with the results of other popular reduction and editing procedures

Lodz University of Technology Repository

Stratna kompresja obrazu z wykorzystaniem aproksymacji liniowej

Author: Raniszewski Marcin.
Publication venue
Publication date
Field of study

Tyt. z nagłówka.Bibliogr. s. 470.Dostępny również w formie drukowanej.STRESZCZENIE: Artykuł przedstawia algorytm stratnej kompresji obrazu z wykorzystaniem aproksymacji liniowej. Omówione są wyniki kompresji przykładowych bitmap. Sformułowane są również wnioski na temat przydatności tego algorytmu dla pewnego rodzaju obrazów. SŁOWA KLUCZOWE: stratna kompresja obrazu, aproksymacja liniowa, JPEG, GIF, PNG, ZIP. ABSTRACT: In this paper lossy image compression algorithm has been presented. The algorithm uses linear approximation. The article discusses the compression result of example bitmaps. The conclusions of the usefulness of the algorithm for some kind of pictures has been discussed. KEYWORDS: lossy image compression, linear approximation, JPEG, GIF, PNG, ZIP

Academic Digital Library (Akademickiej Bibliotece Cyfrowej)

Review of algorithmic and engineering problems in a computer-aided translation application handling multi-language DTP documents

Author: Draus Cezary
Grabowski Szymon
Nowak Grzegorz
Raniszewski Marcin
Publication venue: Lodz University of Technology. Press
Publication date: 01/01/2010
Field of study

Praca przedstawia szereg zagadnień związanych z automatycznym tłumaczeniem katalogów i broszur reklamowych przy użyciu systemu klasy CAT (Computer-Aided Translation) i dokumentuje nasze prace związane z otrzymaniem efektywnych rozwiązań algorytmicznych. Programy CAT zwykle działają na poziomie małych segmentów tekstu (fraz), zorganizowanych w postaci słowników (ang. Translation Memory). Programy CAT umożliwiają m.in. swobodną nawigację po dokumencie, automatyczne tłumaczenie rozpoznanych fraz i sugestie tłumaczenia dla fraz podobnych do już istniejących w systemie, wygodne wyszukiwanie i edycję słowników. Ogólnie biorąc rozważane przez nas zagadnienia można podzielić na: dotyczące interfejsu użytkownika oraz dotyczące algorytmów tekstowych. W szczególności rozwiązaliśmy zagadnienia detekcji symboli (tj. sekwencji znaków nie wymagających tłumaczenia dla większości par językowych takich jak liczby, jednostki fizyczne, kody, numery fabryczne i referencyjne, zastrzeżone znaki towarowe itp.), edycji słowników, etykietowania wybranych elementów dokumentu, tłumaczenia z dziurami (ang. gaps), pasowania rozmytego (ang. fuzzy matching). Funkcjonalności te przyśpieszają pracę tłumacza, minimalizując szansę zaistnienia pewnych klas błędów w procesie tłumaczenia oraz ułatwiają zarządzanie dokumentem oraz bazą słowników. Tym samym, skrócony jest cykl produkcyjny dokumentu, co szczególnie jest ważne przy dokumentach DTP, które wymagają równoległego tłumaczenia na wiele języków (katalogi, broszury reklamowe).We present and discuss a number of problems related to effective translation of product catalogues and advertising brochures with a CAT (Computer-Aided Translation) application. CAT tools usually work on smali text phrases (snippets) organized into so-called Translation Memories (TM). Those tools make it possible to navigate freely over the document, automatically translate recognized phrases and prompt suggestions for translating phrases similar to ones already found in the system, search and update the TMs, and more. The problems and issues we consider here can generally be divided into those related to the user interface and those based on text algorithms. In particular, we solved the problems of symbol detection (where "symbols" are sequences of characters which should not be translated, like numbers, abbreviations of physical units, product codes, reference numbers, registered symbols and trademarks etc), TM editing, document annotation, translating with gaps, fuzzy matching. Those functionalities speed up the work of a translator (e.g., by minimizing the probability of occurrence of some classes of errors in the translation process) and make the management and maintenance of the document and TMs easier. In this way, the document release cycle is shortened, which is of utmost importance for the DTP documents which require parallel translation into many languages (catalogues, advertising brochures)

Lodz University of Technology Repository